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We review some properties of the convex roof extension, a construction used, e.g., in the definition 
of the entanglement of formation. Especially we consider the use of symmetries of channels and states 
for the construction of the convex roof. As an application we study the entanglement entropy of 
the diagonal map for permutation symmetric real JV = 3 states u(z) and solve the case z < 
where z is the non-diagonal entry in the density matrix. We also report a surprising result about 
the behaviour of the output entropy of the diagonal map for arbitrary dimensions N; showing a 
bifurcation at N = 6. 
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I. INTRODUCTION 

Let <I> : uj i-> u)' be a quantum channel or, somewhat more general, a trace-preserving positive map of (mixed) states 
ui € f2 from one quantum system to states ui' £ f2' from another system. We call 

<N E*(u)= min (1) 

O | entanglement entropy of the channel $ or $ -entanglement for short. Here the minimum is taken over all possible 
convex decompositions of the input state to into pure states 

u= 2_^Pj n i' ttj pure, i.e., tt^- = l^jXVil (2) 

i i and S(uj) = — Tr u> log uj is the von Neumann entropy of the output states. We use the symbol c to denote a convex 
sum, i.e., it implies pj > and ^2pj = 1. 

The quantity appears in different places in quantum information theory. For example, 

1. The celebrated entanglement of formation^ of a bipartite quantum system is the <I>-entanglement of the partial 
trace $ = Tta with respect to one of the subsystems of the bipartite system. 

2. The theorem of Holevo, Schumacher, and Westmoreland [21 E] shows that the one-shot or product state classi- 
\Q cal capacity x(^) °f a channel can be obtained by maximising the difference between output entropy and 
<^> entanglement entropy (the so-called Holevo quantity) over all input density operators: 

<N 

7^ = - £*(w) (3) 

> X$ = max x*M 



X 



3. In [4J the optimization problem Eq. ([IJ was considered in connection with the quantum dynamical entropy 
of Connes-Narnhofer-Thirring[5j. In this framework one considers a subalgebra B C A of the algebra A of 
observables. The restriction of states to this subalgebra gives rise to a channel $b,A> the dual of the inclusion 
map B A. The difference S(^(ui)) — E$,(uj) is called entropy of the subalgebra; see also [5] for a thorough 
presentation. 

Closed formulas for the entanglement entropy, i.e., analytic solutions to the global optimization problem Eq. are 
very rare. They include certain classes of highly symmetric states [7HS] and the celebrated entanglement of formation 
of a pair of qubits[TU]. 

Even earlier, Benatti, Narnhofer and Uhlmann jlj 1111 112) studied the entanglement entropy of the diagonal map 
of a 3-dimensional quantum system as an example for the entropy of a subalgebra. The diagonal map (also called 
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pinching channel) $ /j sets all non-diagonal elements of the input state w to zero and corresponds to the choice of a 
maximal abelian subalgebra B C A. Using a mixture of analytical and numerical methods, they found explicit results 
for the entanglement entropy E$ D (called E D in what follows) of the diagonal map <fr D applied to the one-dimensional 
family of permutation symmetric N = 3 real input states 

uj(z) = l^zlfj. (4) 

In this paper we present some remarks about the role of symmetries in the optimization problem (fTj) based on the 
observations in [7J |Sj. Using those insights we provide new results for the entanglement entropy Eo[z) of states of 
the form Q for the case of negative values of the parameter z. 

We also present a result about the output entropy of the diagonal map in arbitrary dimensions. 



II. CONVEX HULLS AND ROOF EXTENSIONS 



The state space f2 of a quantum mechanical system with an ./V-dimensional Hilbert space % is a compact convex 
space of N 2 — 1 real dimensions. 

A (proper) face F of is a non-empty subset F C f2 which is closed under convex compositions and decompositions, 
i.e., whenever ui — c '^2 i piuii and uj £ F, then u>i G F. The (non-disjoint) union of all faces (J Fi = dfl constitutes 
the boundary of Q. There is a one-to-one correspondence between the faces of f2 and linear subspaces of H with 
an K 2 — 1-dimensional face for every A'-dimensional subspace. The face consists of all the states uj with support in 
the corresponding subspace. Zero-dimensional faces correspond to pure states and constitute the extreme boundary 
d e tt C dtt. 

Let f(ui) be a real-valued function on f2. The convex hull f w of / is the largest convex function not larger than /, 
i.e., for which f m (ui) < f(ui) Vw € fi. The convex hull of a function is the solution of the global optimization problem 

/»= nun C J2Pif("i) (5) 

c y piuji—uj 

where the minimum is taken over all convex decompositions of uj. Caratheodory's theorem asserts that we can restrict 
the search for optimal decompositions to decompositions of length up to l max = dim f2 + 1 . 

Let us now consider the case where the function / is concave, such as the von Neumann entropy S(uj). Obviously 
we can then restrict the search for an optimal decomposition to the extremal boundary, uji € d e £l. It follows that 
the convex hull f w depends only on the values of / on <9 e f2 and not on the behaviour of / inside f2, as long as / is 
everywhere concave. 

Therefore we can consider an extension problem which ist closely related to the global optimization problem pj: 
Given a function g(ir) on d e £l, i.e., on the set of pure states, we ask for a canonical extension g u of g to all of 
defined as 

5 U M= min g(nj) (6) 

. This extension was called convex roof extension and intensively studied in, e.g., |13l 114] . It is, in a sense, the 
extension which is as linear as possible while being everywhere convex. 

Definition II. 1 (roof extension). A function G(u>) is called a roof extension of 17(71-) if for every us E £1 there is at 
least one extremal convex decomposition 

uj = y ^PjiTj, WjEd e Q (7) 
such that G(ui) = y,Pj9( n j)- (8) 

If this is the case, we call the decomposition ^ optimal with respect to g or ^-optimal. 

Fig. [T] may illustrate the concept and explain the name. In a roof extension, the ground floor is covered by 
straight roof beams and plane tiles. Those beams and tiles rest with their ends on the the wall erected by g(ir). It is 
immediately clear from the definition of convexity that every convex extension is pointwise majorized by every roof 
extension. But is the largest convex extension a roof? The following theorem asserts that this is true at least when g 
is continuous and fl compact: 
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g(ji) convex extension < roof extension largest convex ext. = smallest roof 

ext. = convex roof 

FIG. 1: An illustration of the convex roof extension where Q is a disc and d e O. a circle. 



Theorem II. 1 ([HE2)- Let g(ir) be a continuous real-valued function on the set of pure states d e fl. There exists 
exactly one function <7 u (w) on f2 which can be characterized uniquely by each one of the following four properties: 

1. g u is the unique convex roof extension of g. 

2. ,g u (w) is the solution of the optimization problem 

<?» = inf C ^> j5 (^). (9) 

3. g u (co) is largest convex extension as well as the smallest roof extension of g. 

Furthermore, given uj £ Q, the function g u is convex-linear on the convex hull of all pure states n appearing in optimal 
decompositions of uj. 

Therefore, g u provides a foliation of into compact leaves such that a) each leaf is the convex hull of some pure 
states and b) g u is convex-linear on each leaf. 

Remark II. 1. 1. The theorem justifies to write "min" instead of "inf" in Eqs. and 

2. If g u is not only linear but even constant on each leaf, it is called a flat roof. 

3. Let f(ui) be a concave function on f2, e.g., f(ui) = S(Q(uj)). Then we denote by / u the convex roof extension 
of Jkn. 

III. SYMMETRIES AND INVARIANT STATES 

The following lemma gives a simple bound for / u . Let P : ft — > S be an affine and surjective map uj i— > £. The 
space S, as the image of £1 under an affine map, is convex and compact, but it need not be a quantum state space. 
The map P provides a foliation of f2 into leaves O = U^e" ^£ v ^ a 

L^ = {uj\ujen,P(uj)=^}. (10) 

Since P is affine, every leaf is generated by cutting f2 with some hyperplane and therefore, the leaves are convex, too. 
We define the function e(£) on S as the minimum of / on the corresponding leaf 

e(0 - min/M = min{/(w) | Puj = £}. (11) 
Lemma III.l. The convex hull of the e function, e w (t;), provides a lower bound for the convex roof f u (co), i.e., 



/»>e M (P W ) 



(12) 
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Proof. Let uj — ^ ^i^i be optimal for / u , so f u (uj) = \f{&i)- Let & = Pwj. Then, due to linearity of P we have 
£, = and due to the definition of e(£) we have fioJi) > e(£j). So, 

/ u H = ^A l /(^)>^A l e(6) (13) 

and from eq. we have 

E^i)>A6. (14) 

□ 

There are some cases where we can find states uj for which the inequality of the lemma can be sharpened to an 
equality. 

Theorem III. 2. Let P = P 2 be a linear and idempotent map of the state space onto itself with an fixed point set 
PQ, = 3cf! of P -invariant states. Let f{ui) be P '-invariant^. e. , 

f(Pu>) = f(u) e (15) 

Then for all P '-invariant states uip £ Pf2 holds 

/> P ) = £> P ) = e> P ) (16) 

and these states have an optimal decomposition completely in PQ, i.e., into P -invariant states only. Furthermore, 
for every state u) it holds that 

/»>/ u (P W ) (17) 

Proof. Since / is P-invariant, it is constant on every affine subspace P~ l ui. Therefore, we have e(ui) — f{ui) on Pfl 
and e is concave. So, e^(uj) = f u (ui). Let uj = c ^ ^j^j be optimal for e u . Then, 

e » = XVK) = T WK) > /»■ (is) 



Together with Lemma III. 1 this proves ([16| and provides an / -optimal decomposition lying in Pf2. □ 

Example III.l. Let P be the projection to real states P(ui) = l/2(w + uj t ) and / the output entropy of the 
diagonal map S® ut (uj) = S($d(u)). Then real states have optimal decompositions into real states only. Furthermore, 
Lemma [III. 1 1 asserts that all non-real states have an entanglement entropy at least as large as their real projections 

E D (u) = / U H > e u (Pw) = E d {Puj) Vwefi (19) 

A slightly different version was used in [7] and worked out in |S]: 

Theorem III. 3 ( [3 IS] ) - Let G be a symmetry goup of f such that f(ui 9 ) = f{uj) for all uj G fi and g G G. Let Pq be 

the twirl map or group average corresponding to G, i.e., the idempotent projection to the subspace PqQ of G -invariant 
states 

u P = P g uj = jL " 9 (20) 
Then for all G -invariant states upG Pg^ holds 

/>p) = e> P ). (21) 

Furthermore, for every state uj G f2 it holds that 

/»>., U (P W ) (22) 

States ujp for which e w (ujp) = e(wp) have an optimal decomposition consisting of one complete orbit of G; otherwise 
the optimal decomposition consists of several complete orbits. 
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Proof. We assume that ujp — J^^i^i is optimal for e m (ujp). Let Qi be states which achieve the minimum in eq. (Ill 
for the we e(u>i) = f{£ji) and uji belongs to the leaf Li. So, Pu>i = Ui and therefore 



UJp = 



E 



A 



1 r,s 



\G\ 



is a candidate decomposition for /. So, 



With f(co 9 ) = /(w) and J2 g t^t = X, the right hand side evaluates to 



(23) 



(24) 



(25) 



This, together with lemma III. 1 proves the theorem and shows that decomposition (23) is optimal for / L 



□ 



Please note that the G-invariance of / does not imply Pg-invariance of /. This is the main difference between 
Theorem |III.2| and Theorem |III.3| for applications. 

Only in the case where /(w) is P^-invariant /(Pw) = /(w) (which implies G-invariance f(co 9 ) = f(u>)) we know 
that every G-invariant state has an optimal decomposition consisting solely of G-invariant states. 



IV. OUTPUT ENTROPY OF THE DIAGONAL MAP 



The diagonal map <!>£> maps f2jv, the state space of an ./V-dimensional Hilbert space, to the simplex 
£l' N — {xi, X2, Xn}> < Xj < 1, Y] xj = 1. It corresponds to a complete von Neumann measurement. Its 
Kraus form is 

N 

$(&o = p ^ ( 26 ) 
i=i 

with Pi — The output entropy of this channel is 

Sf ut ( W ) = S($ D (u>)) (27) 

N 

= E^) ( 28 ) 

i=l 

with the usual abbreviation rj(x) = —xlog(x) for x > and rj{0) = 0. This function is not only concave but a concave 
roof, as was shown in |14j . 

The minimal output entropy is zero and the maximal one is log N. 

Things become more refined by restricting the channel onto a face of SIn. As an example we take the (JV — 1)- 
dimcnsional subspace Ho which is orthogonal to the vector \<f>) = iV -1 / 2 ^ It consists of vectors X) a j'b) such 
that ^2 a j — 0- %o supports pure states satisfying <i>£>(7r) — N^ 1 ! and so the maximal output entropy is logiV again. 

For the minimal output entropy we have a more complex result: 

Theorem IV. 1. Let flo be the face offlj^ consisting of states whose support is orthogonal to Xa=i K)- Let S® in be 
the minimal output entropy of the diagonal map Then we have: 

• We have S^ in (Clo) = log 2 for For N — 2, 3,... 6. This is achieved by the N(N — l)/2 pure input states 
ftjk, j < k, and only by these states. Here, njk = \4>jk){4'jk\ with \4> nm ) = 2 -1 / 2 Qn) — \m)). 

• For N > 6 we have 

S£jn ) = logiV - ( 1 - log(7V - 1) (29) 



N , 

and limjv->oo Smin = 0- The minimum is achieved by the N states ttj = \4>j)(<f>j\ where 

cj> 1 = ((N-l)a,-a,-a,...,-a) with a = (N(N - l))~ 1/2 (30) 
and the other <pi are obtained by permuting the components. 
The proof of this theorem is found in the appendix. 
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V. ENTANGLEMENT ENTROPY OF THE TV = 3 DIAGONAL MAP FOR SOME SUBSETS OF STATES 



A. Geometry of the N = 3 state space 



The space 3 of positive hermitean 3x3 matrices with unit trace has 8 real dimensions. Its boundary consists 
of zero-dimensional faces (pure states) and three-dimensional faces (Bloch balls), the latter corresponding to two- 
dimensional subspaces of the Hilbert space % = C 3 . 

We will use the notion ip = (a, b, c) to denote a one-dimensional subspace [Ct/j] of C 3 as well as the corresponding 
point 7r^ = IV'KV'I °f ^3- Here, (a, 6, c) is a generally unnormalized element of this one-dimensional subspace. The 
set of states orthogonal to a given pure state ip form a Bloch ball which we denote by B J -('0): 

B x (V0:={a,|Mu#)=O} (31) 

and all non-trivial faces of O3 are obtained in this way: There is a Bloch ball opposite to each pure state and this 
gives a bijection between the 0- and 3-dimensional faces of f2. 

More generally, we can consider for every pure state ijj the foliation of f2 3 by parallel hyperplanes (F) defined as 

£. (F) :={w| = F} (32) 

where F is the fidelity parameter. The leaves are 7-dimensional in the generic case, but the highest leaf L^(l) = tt^, 
consist of one pure state only and the lowest leaf L,/,(0) = B^^) is the Bloch ball opposite to tp. Furthermore, every 
basis {ipi, i>2> ^3} °f three orthogonal pure states spans an equilateral triangle in f2 3 . Every edge ipi^j of this triangle 
is the diameter of the Bloch ball B ± (i};k) orthogonal to the opposite vertex, see Fig. m All these triangles have the 




FIG. 2: Bloch ball face and opposite face. Note that the plane of the triangle and the ball have only a one-dimensional 
intersection, the diameter ip2ip3 



barycenter 1/3 of £1 in common. 



B. States of lowest entanglement entropy 



The triangle spanned by the computational basis ^o^,^i^,^o^ is the lowest leaf of the roof of Er> = 5^ , the 
leaf where Ed = 0. Especially, £7(1/3) — 0. This triangle is the fixed point set of the diagonal channel <f>c- 



C. Some rank-2 states 



We can calculate Ed for every Bloch ball which includes one of the three states of the computational basis. Take, 
for example, B(tp , ipab), the ball spanned by V'o = (1, 0, 0) and some orthogonal state V'afc = (0, a, b). This ball is the 
image of the unitary embedding V : ( J ) H> ^ j , ( ? ) i-> of a standard Bloch ball. This embedding can be used 

to reduce the calculation of the convex roof to the N = 2 case, see chapter 6.2 in [TJ]. Using the known results for 
the N = 2 diagonal map (see, e.g., [T^]) we find for states from this Bloch ball, i.e., states of the form 



(1 — z xa xb \ 
x*a* z a* a z a*b 
x*b* zab* zb*b) 



(33) 
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with real z and complex x, a, b the entanglement entropy 

1 + A\ /1-A 



Ed = rj I — - — I + r\ f — - — \ + z rj(a* a) + z r](b* b) where A = VT— Ax*x. (34) 



D. Real permutation invariant states 



The permutation group G = S3 acts on f2 3 by permuting the computational basis / ( j , f 1 ^ , ( j | . The corre- 
sponding twirl acts on normalized pure states ip = (a, b, c) as 



'1 z z^ 
z 1 z 
z z 1, 



P G (7ty) = - ( z 1 z| where 2z = ab* + a*b + ac* + a*c + be* + b*c. (35) 



Let us denote the S3-invariant state on the right hand side as ui(z). In what follows, we restrict our considerations to 
real states w = oj t . Then the real parameter z can take values in the range 

-\<z<l. (36) 

Another often used parametrization for these states uses the fidelity with respect to the state u>(l) = tt(1, 1, 1). We 
have 

^=(^(1,1,1)1^1^(1,1,1)) = ^(2«+l). (37) 

The state lu(z) is of rank three except at the boundaries of the z range: 

• For z — 1 we have a pure state 

w(l)=7r(l,l,l) (38) 

where we use n(a,b,c) as shorthand for the pure state tt^ = \ip) (ip\ with ip ~ (a, b, c). So its entanglement 
entropy equals its output entropy and we have 

£k(l)=log(3). (39) 

• For z = — !/2 we have a rank-2 state 

w(-Va) = I W-1,0, 1) + 7r(-l, 2, -1)] . (40) 



This state belongs to the face of considered in Section |IV| so its entanglement entropy can't be smaller than 
log(2). It is easy to see that this value can indeed be achieved by the optimal decomposition 

u (-1/2) = I [7r(l, 0, -1) + tt(0, -1, 1) + 4-1, 1,0)] , (41) 

and therefore 

E D (-1/2) - log(2) (42) 

• Let's also mention that for z = we have the maximally mixed state which belongs to the lowest leaf of 
Section |VB| So an optimal decomposition is 

w(0) = i [7r(l, 0, 0) + tt(0, 1,0)+ tt(0, 0, 1)] , (43) 

^d(0) - 0. (44) 
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Applying Theorem III. 2 using the projection Poj = |(u; + w T ) we see that the states oj(z) have optimal decompo- 
sitions including only real states. Furthermore, for an arbitrary state we have 



Bd(w) > E D 



Applying now Theorem |III.3| with the projection Pq to the space of real states we learn that 

Ed{z) = e (z) where e(z) = min Sd (iv(a,b,c)) , with z, a, b, c real 



The minimization in Eq. ( 46 1 is one-dimensional since the three real parameters are constrained by a 2 
ab + bc+ ac = z. A useful paramctrization of this constraint is |4J 

3 (a, b, c) = ( a + 2(3 cos 9, a — 2(3 cos( 



(45) 



(46) 



c 2 = l, 



tt/3), a-2/3cos(6> + 7r/3)) where 



y/2z+l, (3 = y/T^z (47) 



Numerical search for the minimum in Eq. (46 1 shows that the minimum is reached at 9 = for all z > —0.4150234. 
For smaller values of z, 9 m [ n increase up to m j n = */6 at z = — A thorough analysis of the function e(z) obtained 
by this minimization shows that it is not everywhere convex. In the region z > we re-obtained the result of [12] : 
the convex hull e n is obtained by replacing e(z) in the region 5 /6 < z < 1 with a linear piece. 

In the negative- z region our results differ from |12j . who claimed that e(z) is convex there. We find that the convex 
hull is obtained by replacing e(z) in the region between z — — 1 /2 and z — z* — —0.4079496711 with a linear piece, 
see Fig. [3] 




FIG. 3: Entanglement entropy and optimal decompositions. Regions where the convex hull construction leads to a linear 
behaviour are drawn in red. The drawings indicate the shape of optimal decompositions of the red state into pure (blue) states 
in the three regions z < z* , z* < z < 5 /6, 5 /6 < z, resp. Here, gray surfaces indicate hyperplanes of states which project under 
Pa to the same value of z. The corners of the triangles form a complete orbit under the permutation group G = S3. 

Interestingly, everywhere in the region z* < z < 5 /6 where e(z) = e w (z) the minimum is obtained for states with 
9 = 0. So the optimal decompositions in this region have the form 



1 



U) 



7r(o, 6, b) + 7r(6, b, a) + tt(6, a, b)] 



(48) 



corresponding to a short orbit of S3 of length 3 only. In the region — 1 /2 < z < z* the optimal decomposition has 
length 6 and is a mixture of two such short orbits resulting in a large region in state space where the entanglement 
entropy is an affine function. 
With 



S(z) = 2i] 



(a-(3)< 



{a + 2/3)' 
9 



where a — \/2z + 1, (3 = y/l — 



(49) 
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and z* — —0.4079496711, S( 5 /e) = log(3) — 1 /3log(2) the final result for the entanglement entropy is therefore 

( plog(2) + (1 - p)S(z*) for - 1/2 < z < z* with p = -f^- 2 

E D (z) = I S(z) for z* < z < 5/e (50) 

(p[log(3) - V 3l °g( 2 )] + 0--P) lo g( 3 ) for 5 /e<z<l with p = ^st 6 
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Appendix A: Proof of theorem |IV.1| 

1. We have S® ut (oj) = S^ ut (uj T ) = S^ nt (^(uj + w T )). Therefore we can restrict our search for the minimum to the 
subspace Or = {w | w = us T } of real states. 

2. The minimal output entropy is attained by pure states since S® nt is concave and f2j> is convex. 

3. The case N = 2 is trivial. There is only one pure real state 0i2 = <f>i — 2~ 1//2 (1,— 1) in Hq with S® ut = log 2. 
So we now assume N > 3. 

4. The pure real states in have the form 7r = IV'XV'I with 

\tp) = y~] aj\j), 5Z a j — 0' 5Z a i = -^ which implies a 4 e(— 1,1). (Al) 
So we use Lagrange's multiplier method to find the minimum of 

S» t (n) = -J2*>&°?j (A2) 



for all at satisfying eq. ( Al I. The equations to solve read 

a* log af + a, = A + a,// (A3) 
or: aj log a 2 = A + asj/i (A4) 



where A,// denote Lagrange multipliers for the constraints Eq. (Al) and fi = // — 1. Multiplying eq. (A4) by 
dj and summing over j yields 



S'out = — fi and therefore, fi < 0. (A5) 



Not all solutions of eq. (A4| have minimal output entropy but all states of minimal entropy must be solutions 



of eq. ( A4 1 . So can find the minimum by classifying all solutions and comparing their entropy. Let us consider 



different cases: 

(a) A = 0, so the solutions of eq. ( |A4| are a* S {0, ± exp(/x/2)}. Let m instances of the a, be nonzero. Then 
their modulus must be m -1 / 2 for ^ a 2 = 1 and S® ut = —fj, = logm. Since m must be even for ^2 a,j = 0, 
the minimum value for Sd is achieved for m = 2. So one candidate for the minimum of S® ut is 

3? ut = log2, \4> jk ) = 2- 1 / 2 (\j)-\k)) (A6) 

(b) A ^ 0. Then all the Oj have to be non-zero. The transcendental equation A + x/i = xlogx 2 can be rewritten 

as 

^e~ = = ^j~| e ^ = ±we w with uj = X/2x (A7) 

The inverse of the function f(w) = we w is the Lambert W function W(z)^£]i defined via 

z = W{x)e w{z) . (A8) 
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As an inverse of a non-injective function it has multiple branches, two of which are real and denoted as W_i 
and Wq . It follows that A + fix = x log x 2 has no more than three real solutions which can be expressed as 



Xi 



■i'2 



X3 



A 

2W (z) 
A 

2W (-z) 
A 

2W_i(-z) 



where z = — e 
2 



(A9) 
(A10) 
(All) 



Since A 7^ we have z > 0. Then a solution xi does always exist and the solutions xi and X3 exist only if 
z < 1/e. They are equal for z = 1/e. 

i. Let us assume that only two of the values, say x\ and xi are used in the state. So we have 



nx\ + mx2 = 0, no: 2 + mx 2 = 1, n + m = N, n, m > 1 



resulting in x 2 



iJV> ^2 



and so 

<nN 



S(N,n) = logiV 



Now this expression is concave in n: 

d 2 S(N,m) 
dn 2 



N 2 



n 2 (N -nf 



< for < n < N 



(A12) 



(A13) 



(A14) 



and therefore takes for fixed N its minimum at the edges of the allowed n-range, n — 1 or, equivalently, 
n = N-l. 

So the second possibility for a minimum of S® ut is 



5 ^ ut = log N - ( 1 - - ) log (N - 1) , {a,} = (a, a, a, (1 - N)a) 



(A15) 



ii. The last possibility is that all three roots xi,X2,X3 occur among the aj. Consider the function 

F{\,ii)=x\+xl + xl (A16) 
where the Xi are the three solutions of A + fix = xlogx 2 . Using Eqs. ( A8|A9|A10|A11 ) we find 

F(X, n) = e» (e 2W ^ + e 2W °^ + e 2W -^) (A17) 
= e M G(A, ^u) (A18) 
where G(A, ,1) = e 2W °« + e 21 ^ 2 ) + e 2W ~^ (A19) 

Now we have (7(0) = 2 and G'(z) > for z G (0, 1/e] (see Fig. H, therefore G(z) > 2 for z G (0, 1/e]. 



0.05 0.10 0.1 



0.20 0.25 0.30 0.35 



FIG. 4: Plot of G'(z) over z 

Since we need F < 1 for a normalized state vector, this implies e M < i, fi < — log 2, S^ L = —fj, > log 2 
and therefore any such solution has larger output entropy than the state given by Eq. (A6l. 
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FIG. 5: The output entropies log 2 and Eq. |A15| plotted over N 



5. The only thing left to do is to compare the two candidates for a minimum, Eqs. ( A15 1 and ( |A6| . It is easy to 
see, that candidate (A6l wins for N < 6 and candidate (A15) wins for all N > 6, see Fig. [5} □ 
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